Basic Statistics

Raw Counts

Name Value
Rows 307,511
Columns 122
Discrete columns 16
Continuous columns 106
All missing columns 0
Missing observations 9,152,465
Complete Rows 8,602
Total observations 37,516,342
Memory allocation 286.3 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 1 columns ignored with more than 50 categories.
## ORGANIZATION_TYPE: 58 categories

QQ Plot

## Warning: Removed 1 rows containing non-finite values (stat_qq).
## Warning: Removed 1 rows containing non-finite values (stat_qq_line).

## Warning: Removed 673 rows containing non-finite values (stat_qq).
## Warning: Removed 673 rows containing non-finite values (stat_qq_line).

## Warning: Removed 2948 rows containing non-finite values (stat_qq).
## Warning: Removed 2948 rows containing non-finite values (stat_qq_line).

## Warning: Removed 5226 rows containing non-finite values (stat_qq).
## Warning: Removed 5226 rows containing non-finite values (stat_qq_line).

## Warning: Removed 4868 rows containing non-finite values (stat_qq).
## Warning: Removed 4868 rows containing non-finite values (stat_qq_line).

## Warning: Removed 5136 rows containing non-finite values (stat_qq).
## Warning: Removed 5136 rows containing non-finite values (stat_qq_line).

## Warning: Removed 5196 rows containing non-finite values (stat_qq).
## Warning: Removed 5196 rows containing non-finite values (stat_qq_line).

## Warning: Removed 1710 rows containing non-finite values (stat_qq).
## Warning: Removed 1710 rows containing non-finite values (stat_qq_line).

## Warning: Removed 816 rows containing non-finite values (stat_qq).
## Warning: Removed 816 rows containing non-finite values (stat_qq_line).

Correlation Analysis

## 1 features with more than 20 categories ignored!
## ORGANIZATION_TYPE: 55 categories
## Warning in cor(x = structure(list(SK_ID_CURR = c(100083, 100145, 100179, : the standard deviation is zero

Principal Component Analysis

## 1 features with more than 50 categories ignored!
## ORGANIZATION_TYPE: 55 categories
## Warning in plot_prcomp(data = structure(list(SK_ID_CURR = c(100083, 100145, : The following features are dropped due to zero variance:
##  * FLAG_MOBIL
##  * FLAG_DOCUMENT_2
##  * FLAG_DOCUMENT_4
##  * FLAG_OWN_CAR_Y